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© Layer encoding/decoding apparatus for input non-interlace video signal. 



© In an encoder (1, 2; Fig. 1), an input picture of a 
non-interlace structure is sampled by down sampling 
to form an interlace picture.and the interlace picture 
is encoded. The encoded data are decoded to form 
^ an interlace decoded picture, and the interlace de- 
O coded picture is sampled by up sampling and uti- 
^1 lized as a predictive picture upon encoding of the 
™ non-interlace input picture. In a decoder (7, 4; Fig. 
CO 

o> 
in 



a. 

UJ 



2) t a bit stream corresponding to an interlace picture 
of a low resolution is decoded into an interlace 
decoded picture. The interlace decoded picture is 
sampled by up sampling and is utilized as a predic- 
tive picture upon decoding of a bit stream cor- 
responding to a non-interlace picture of a high reso- 
lution. 
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FIELD OF THE INVENTION 

This invention relates to an encoding and/or 
decoding apparatus suitable for use with informa- 
tion recording and/or reproduction apparatus in 
which a storage type moving picture medium such 
as an optical disk or a magnetic tape is used and 
information transmission and/or reception apparatus 
such as television conference systems, moving pic- 
ture telephone systems and broadcasting equip- 
ments. 

BACKGROUND OF THE INVENTION 

One of moving picture encoding and/or decod- 
ing systems wherein a spatial resolution between 
an input picture and an output picture is varied is a 
layer encoding/decoding system investigated in the 
ISO-IEC/JTCI/SC29/WGII. In the layer encod- 
ing/decoding system, basically on the encoder 
side, an input image of a low resolution is encoded 
in a lower layer to form a bit stream, and a de- 
coded picture of the bit stream of the lower layer is 
converted by up conversion and used for encoding 
of an other input picture of an upper layer having a 
high resolution. On the other hand, on the decoder 
side, the decoded picture of the lower layer is 
converted by up conversion and used for decoding 
of a bit stream of the upper layer. 

A detailed example of such layer encod- 
ing/decoding systems is disclosed in Proposal for 
Test Model 2, ISO-IEC/JTCI/SC29/WGII, revision 2, 
pp.101 -108, October 19 t 1992. 

The layer encoding/decoding system, however, 
has a subject to be solved in that, since it is 
constructed so as to cope only with the case 
wherein the input is an interlace picture, it cannot 
cope with another case wherein the input is a non- 
interlace image. 

OBJECT AND SUMMARY OF THE INVENTION 

It is an object of the present invention to pro- 
vide an encoding apparatus and a decoding ap- 
paratus which realize layer encoding and decoding 
of a non-interlace input image, respectively. 

In according with the present invention, an in- 
terlace image is formed from an input non-interlace 
image. The interlace image is encoded to form a 
first encoded signal. A non-interlace image is 
formed from the first encoded signal. The input 
non-interlace image is encoded using the non-inter- 
lace image. 

Further, in accordance with the present inven- 
tion, a decoded interlace image is formed from a 
first input bit stream. A first decoded non-interlace 
image is formed from the decoded interlace image. 
A second decoded non-interlace image is formed 



from the first decoded non-interlace image and a 
second input bit stream. 

The above and other objects, features and ad- 
vantages of the present invention will be apparent 
s from the following detailed description of the pre- 
ferred embodiment when read in conjunction with 
the accompanying drawings in which like reference 
numerals are used to identify the same or similar 
parts in the several views. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram showing the construc- 
tion of ah encoder wherein an upper layer has a 

75 non-interlace structure; 

FIG. 2 is a block diagram showing the construc- 
tion of a decoder wherein an upper layer has a 
. non-interlace structure; 

FIG. 3 is a diagrammatic view showing the con- 

20 struction of a down sampling circuit; 

FIG. 4 is a diagrammatic representation illustrat- 
ing down sampling operation; 
FIG. 5 is a diagrammatic view showing the con- 
struction of an up sampling circuit; 

25 FIG. 6 is a diagrammatic representation illustrat- 
ing up sampling operation; and 
FIG. 7 is a block diagram showing an apparatus 
for recording a bit stream. 

30 DETAILED DESCRIPTION OF THE INVENTION 

In the following, a system will be described 
wherein, for example, a non-interlace image is in- 
putted and then decoded with the ratio of 2:1 

35 between the numbers of scanning lines of an upper 
layer and a lower layer to obtain an output bit 
stream which has compatibility with both of a non- 
interlace image and an interlace image. 

An encoding system or encoder according to 

40 the present invention will be described with refer- 
ence to FIG. 1 . First, a lower layer encoder block 2 
will be described. 

In the encoder, processing is performed in 
units of a macro block. An input video signal S21 

45 of a high resolution having a non-interlace structure 
is inputted to a down sampling circuit 22, in which 
it is converted into an interlace image of a low 
resolution. Upon such conversion, a motion vector 
from a motion estimator 37 is referred to to adap- 

so tively determine whether or not the input video 
signal S21 should be passed through a low-pass 
filter upon sampling of the data. Here, the motion 
vector may be supplied from a motion estimator 24 
which will be hereinafter described. In this instance, 

55 however, since a motion vector of an image a little 
prior in time is used, the encoding efficiency on the 
upper layer side sometimes exhibits a rather low 
level. Details of the down sampling circuit 22 will 
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be hereinafter described. 

An interlace picture of a block format sampled 
by down sampling by the down sampling circuit 22 
is stored into a frame memory set 23 and used for 
detection of a motion vector by the motion estima- 
tor 24. The motion estimator 24 detects a motion 
vector from a reference picture at present using a 
forward predictive picture and a backward predic- 
tive picture from among pictures stored in the 
frame memory set 23. Here, in detection of a 
motion vector, one of motion vectors with which the 
absolute value sum of differences between fields or 
frames in units of a block is determined as a 
motion vector. 

Information of the motion vector is transmitted 
to a motion compensation frame memory set 25, in 
which selection of a macro block type is formed. 
Then, a predictive picture corresponding to the 
thus selected macro block type is sent to a dif- 
ference unit 26. The difference unit 26 calculates a 
difference between the input picture and the pre- 
dictive picture and outputs the difference to a dis- 
crete cosine transform (DCT) circuit 27. 

The DCT circuit 27 transforms the input picture 
data or the difference data in units of a block by 
discrete cosine transform making use of a two- 
dimensionai correlation of a video signal and out- 
puts transform data obtained by the discrete cosine 
transform to a quantization circuit 28. 

The quantization circuit 28 quantizes the DCT 
transform data with a quantization step size, which 
is determined for each macro block or siice, and 
supplies quantization data obtained at an output 
terminal thereof as a result of the quantization to a 
variable length coding (VLC) circuit 29 and a de- 
quantization circuit 30. 

The VLC circuit 29 processes the quantization 
data by variable length coding processing together 
with the quantization scale, the macro block type, 
the motion vector and so forth and outputs a lower 
layer bit stream S31 obtained by the variable 
length coding as transmission data. 

The dequantization circuit 30 dequantizes the 
quantization data sent out from the quantization 
circuit 28 into dequantization data and decodes the 
transfor m data of the output data before the trans- 
formation by the quantization circuit. Then, the 
dequantization circuit 30 supplies the dequantiza- 
tion data to an inverse discrete cosine transform 
(IDCT) circuit 32. 

The IDCT circuit 32 transforms the dequan- 
tization data decoded by the dequantization circuit 
30 back into decoded picture data by inverse trans- 
form processing to that of the DCT circuit and 
outputs the decoded picture data to the motion 
compensation frame memory set 25. 

The motion compensation frame memory set 
25 performs motion compensation based on the 
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motion vector supplied thereto from the motion 
estimator 24 and writes the decoded picture as a 
forward predictive picture or a backward predictive 
picture into the frame memories. In the case of 

5 forward, backward or bidirectional prediction, the 
difference from the predictive picture is transmitted 
as an output of the iDCT circuit 32 to the motion 
compensation frame memory set 25, and accord- 
ingly, local decoding is performed by adding the 

w difference to the predictive picture using an adder 
33. 

The predictive picture here is identical with the 
picture decoded by the lower layer decoder, and 
forward, backward or bidirectional prediction is per- 

1$ formed for a picture for next processing based on 
the predictive picture. 

The locally decoded picture outputted from the 
adder 33 is supplied also to an up sampling circuit 
34. While the locally decoded picture of the lower 

20 layer has an interlace structure, interpolation is 
performed for the locally decoded picture by the 
up sampling circuit 34 so that the locally decoded 
picture is converted into a non-interlace picture 
similarly to that of the upper layer, and the non- 
25 interlace picture is inputted to a spatio-temporal 
predictor 39 of the upper layer. 

The up sampling circuit 34 adaptiveiy refers to 
both of a picture from the upper layer which in- 
cludes same spatial sample points with a full de- 

30 gree of accuracy but are different in temporal sam- 
ple points and another picture from the lower layer 
which has a lower resolution due to sampling but 
has same temporal sampling points based on the 
motion information to produce a non-interlace inter- 

35 polation picture. Details of the up sampling circuit 
34 will be hereinafter described. 

Subsequently, the construction of the upper 
layer will be described. In the upper layer encoder 
block 1, the input non-interlace video signal S21 is 

40 processed while remaining in the full degree of 
accuracy. The input non-interlace video signal S21 
is delayed by a deiay circuit 35 and then stored 
into a frame memory 36, whereafter detection of a 
motion vector is performed for the input non-inter- 

45 lace video signal S21 by a motion estimator 37. 
The motion estimator 37 operates similarly to the 
motion estimator 34. 

The information of the motion vector is sent to 
a motion compensation frame memory set 38, and 

50 a corresponding predictive picture is inputted to a 
spatio-temporal predictor 3d. The spatio-temporal 
predictor 39 adaptiveiy selects, by weighting, the 
predictive picture from the motion compensation 
frame memory set 38, that is, a predictive picture 

55 from the temporal direction, and the predictive pic- 
ture from the up sampiing circuit 34, that is, an- 
other predictive picture from the spatial direction. 
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The spatio-temporal predictor 39 determines a 
coefficient a based on the motion information from 
the motion estimator 37, applies weighting of 1 - a 
to the predictive picture in the temporal direction, 
that is, the predictive picture of the upper layer and 
applies weighting of a to the predictive picture in 
the spatial direction, that is, the predictive picture 
from the lower layer, and then sends the thus 
weighted predictive pictures to a cumulative dif- 
ference unit 40. 

The difference unit 40 calculates a difference 
between the input picture and the predictive pic- 
tures and outputs the difference to a discrete co- 
sine transform circuit (DCT) 41. 

The DCT circuit 41 transforms the input picture 
data or the difference data by discrete cosine 
transform making use of a two-dimensional correc- 
tion of a video signal and outputs transform data 
obtained by the discrete cosine transform to a 
quantization circuit 42. 

The quantization circuit 42 quantizes the DCT 
transform data with a quantization step size, which 
is determined for each macro block or slice, and 
supplies quantization data obtained at an output 
terminal thereof as a result of the quantization to a 
variable length coding (VLC) circuit 43 and a de- 
quantization circuit 44. 

The VLC circuit 43 processes the quantization 
data by variable length coding processing together 
with the quantization scale, the macro block type, 
the motion vector and so forth and outputs an 
upper layer bit stream S45 obtained by the variable 
length coding as transmission data. 

The quantization circuit 44 quantizes the quan- 
tization data sent out from the quantization circuit 
42 into a representative value to obtain dequan- 
tization data and decodes the transform data back 
of the output data before the transformation by the 
quantization circuit 42. Then, the dequantization 
circuit 44 supplies the dequantization data to an 
inverse discrete cosine transform (IDCT) circuit 46. 

The IDCT circuit 46 transforms the dequan- 
tization data decoded by the dequantization circuit 
44 back into decoded picture data by inverse trans- 
form processing to that of the DCT circuit 41 and 
outputs the decoded picture data to the motion 
compensation frame memory set 38. 

The motion compensation frame memory set 
38 performs local decoding based on the motion 
vector supplied thereto from the motion estimator 
24, the, macro block type and the rnptiori. vector, 
and writes the locally decoded picture as a forward 
predictive picture or a backward predictive picture 
into the frame memories. In the, case of forward or 
bidirectional prediction, the difference from the pre- 
dictive picture is transmitted as an output of the 
IDCT circuit 46 to the motion compensation frame 
memory set 38, and accordingly, local decoding is 
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performed by adding the difference to the predic- 
tive picture using an adder 47. 

The predictive picture here is identical with the 
picture decoded by the upper layer decoder, and 

5 forward or bidirectional prediction is performed for 
a picture for next, processing based on the predic- 
tive picture 

FIG. 3 shows the .construction of the down 
sampling circuit ,22. An input picture is inputted to 

10 a low-pass filter 71, and a switch 73 adaptively 
selects the input picture or the output picture of the 
low-pass filter 71 in response to a motion vector. 
The output of the switch 73 is alternately sampled 
by a sampling circuit 72. Since the picture after 

75 such sampling includes reflected components, nor- 
mally it is acted upon uniformly by the low-pass 
filter 71 , but due to the characteristic that the input 
is a non-interlace video signal, when the motion is 
small, sample points which are identical in spatial 

20 positions with but different in temporal positions 
from thinned out points are present without fail. 

FIG. 4 illustrates operation of the down sam- 
pling circuit 22. Referring to FIG. 4, two columns of 
dots respectively indicate the positions of pixels 

25 arranged in the vertical direction in a single picture. 
The down sampling circuit 22 thins out those pixels 
at alternate positions from two pictures while sam- 
pling only those pixels surrounded by squares ,in 
FIG. 4 to convert the input non-interlace video 

30 signal into an interlace picture. In the case of, for 
example, a still picture, the two points surrounded 
by a dotted line in FIG. 4 represent identical data, 
and accordingly, filtering processing is unneces- 
sary as described above. 

35 FIG. 5 shows the construction of the up sam- 

pling circuit 34. the input to the up sampling circuit 
34 includes data of the upper layer which are at the . 
same spatial positions but are at different temporal 
positions, data interpolated from sample points of 

40 the same field of the lower layer, and data at the 
corresponding points from a different field of the 
lower layer. Weighting interpolation is thus per- 
formed for the inputs using coefficients wl, w2 and 
w3 which are adaptively determined by a coeffi- 

45 cient generator 81 1 . 

FIG. 6 illustrates operation of the up sampling 
circuit 34. Referring to FIG. 6, two left side columns 
of dots respectively indicate the positions of pixels 
arranged in a vertical direction in. a single picture of 

so the upper layer. Meanwhile, the other two right side 
columns of dots respectively indicate the positions 
of pixels arranged in a vertical direction in a single 
picture of the lower layer. As seen from FIG. 6, the 
up sampling circuit 34 performs, for a pixel for 

55 which interpolation should be performed, interpola- 
tion from adjacent pixels in the same field of the 
lower layer and weighting interpolation from a pixel 
of the other field of the lower layer and another 

5 
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pixel at the same spatial position of the upper layer 
using the coefficients of w1 and w2, respectively. 
Further, the up sampling circuit 34 adds the inter- 
polation values obtained using the coefficients of 
wl and w2 and the value at the same position of 
the lower layer, that is, the value of the point for 
which interpolation should be performed, with the 
weight of the coefficient of w3. For example, when 
the pixel for which interpolation should be per- 
formed is at the position of 205, the up sampling 
circuit 34 performs interpolation from the adjacent 
pixels 201 and 202 in the same field of the lower 
layer and weighted interpolation from the pixel 203 
at the same position in the other field of the lower 
layer and the pixel 204 at the same spatiaJ position 
of the upper layer using the coefficients of w1 and 
w2. In this instance, the pixel 205 at the same 
position of the lower layer referred to with the 
coefficient of 1 - w3 is a point for which interpola- 
tion of zero has been performed, and since the 
value is 0, it does not have an influence upon the 
interpolation. On the other hand, when the pixel for 
which interpolation should be performed is at the 
position of 206. that is, when the pixel 206 of the 
lower layer which is not equal to zero is present at 
the same position for which interpolation should be 
performed, since the points referred to with the 
coefficients of 1 - w1 and 1 - w2 are points for 
which interpolation of zero has been performed, 
they have no influence upon the interpolation. Both 
when the pixel for which interpolation should be 
performed is at the position of 205 and when the 
pixel for which interpolation should be performed is 
at the position of 206, a similar circuit configuration 
is employed for the processing. 

If the coefficient w1 is set to 0 upon interpola- 
tion, then a value at the point for which interpola- 
tion shouid be performed can be produced using 
values only in the lower layer without using any 
value in the upper layer. Further, the down sam- 
pling circuit 22 and the up sampling circuit 34 can 
be adapted to conversion of a resolution at some 
other ratio than sampling or interpolation of 2:1 by 
suitably selecting an interpolation filter and sample 
points. 

FIG. 7 shows an example of a recording ap- 
paratus for a lower layer bit stream and an upper 
layer bit stream. Referring to FIG. 7, an upper layer 
bit stream S45 outputted from the upper layer- 
encoder 1 and a lower layer bit stream Sol output- 
ted from the lower layer encoder 2 are multiplexed, 
after passing buffers 101 and 102, respectiveiy, by 
a multiplexing circuit 104 with an audio signal, 
which has been encoded by an audio encoder 100 
and passed through a buffer 103, and a synchro- 
nizing signal not shown. A multiplexed bit stream 
outputted from the multiplexing circuit 104 is pro- 
cessed by addition processing of a code for error 



correction by an error correction code addition 
circuit 105, and then processed by predetermined 
modulation by a modulator 106, whereafter it is 
recorded as concave and convex pits on a master 

5 disk 107 by means of a laser beam. A stamper is 
formed making use of the master disk 107, and a 
large number of copy disks, such as, for example, 
optica! disks, are formed using the stamper. 

Subsequently, the decoding system (decoder) 

w of the present invention will be described with 
reference to FIG. 2. First, a lower layer decoder 
block 4 will be described. A lower layer bit stream 
S51 is inputted to the decoder of the lower layer by 
way of a transmission medium such as an optical 

75 disk. The bit stream S51 is inputted to a variable 
length decoding (IVLC) circuit 52. The variable 
length decoding circuit 52 decodes the bit stream 
S51 and outputs quantization data, a motion vector, 
a macro block type, a predictive mode, a quantiza- 

20 tion scale and so forth. 

A dequantization circuit 3 dequantizes the 
quantization data outputted from the variable length 
decoding circuit 52 into a representative value 
based on the quantization scale to obtain dequan- 

25 tization data and decodes the transform data before 
the transformation by the quantization circuit of the 
encoder. Then, the dequantization circuit 53 sup- 
plies the dequantization data to an inverse discrete 
cosine transform (iDCT) circuit 54. 

30 The IDCT circuit 54 transforms the dequan- 

tization data decoded by the dequantization circuit 
53 back into decoded picture data by inverse trans- 
form processing to that of the DCT circuit 27 and 
outputs the decoded picture data to a motion com- 

35 pensation frame memory set 55. Further, the de- 
coded picture data having an interlace structure is 
simultaneously outputted as a lower layer output to 
an up sampling circuit 57. 

The motion compensation frame memory sat 

40 55 performs decoding based on the output data of 
the IDCT circuit 54, the rnacroblock type and the 
motion vector and writes the decoded picture as a 
forward predictive picture or a backward predictive 
picture into the frame memories, in the case of 

45 forward, backward or bidirectional prediction, a dif- 
ference from a predictive picture is transmitted as 
the output of the IDCT circuit 54 to the motion 
compensation frame memory set 55, and accord- 
ingly, decoding is performed by adding the dif- 

50 ference to the predictive picture using an adder 56 
to form a lower layer picture S66 having an inter- 
lace structure as a decoded picture. 

The predictive picture here is identical with the 
picture decoded by the local decoder of the en- 

55 coder, and forward, backward or bidirectional pre- 
diction is performed for a picture for next process* 
ing based on the predictive picture. 
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Simultaneously, the output picture S66 of the 
lower layer is inputted to the, up sampling circuit 
57, in which it is processed by interpolation so that 
it is converted into a non-interlace picture. The 
non-interlace picture is inputted to a spatio-tempo- 5 
ral predictor 58. 

The up sampling circuit 47 operates similarly 
to the up sampling circuit 34 described 
hereinabove in connection with the encoder. 

Subsequently, the upper layer will . be de- to 
scribed. An upper layer input bit stream 59 is 
inputted to the upper layer. The bit stream 59 is 
inputted to a variable length decoding (IVLC) circuit 
60. The variable length decoding circuit 60 de- 
codes the bit stream 59 and outputs quantization 75 
data, a motion vector, a macrotfiock type and a 
quantization scale. 

A dequantization circuit 61 dequantizes the 
quantization data outputted from the variable length, 
decoding circuit 60 into a representative value 20 
based on the quantization scale to obtain dequan- 
tization data and decodes the transform data before 
the transformation by the quantization circuit of the 
encoder. Then, the dequantization circuit 61 sup- 
plies the dequantization data to an inverse discrete 25 
cosine transform (IDCT) circuit 62. 

The IDCT circuit 62 transforms the dequan- 
tization data decoded by the dequantization circuit 
61 back into decoded picture data by inverse trans- 
form processing to that of the DCT circuit 41 and 30 
outputs the decoded picture data to a motion com- 
pensation frame memory set 63. Further, the , de- 
coded picture data having an interlace structure is 
simultaneously outputted as an upper layer output 
picture S65 having a non-interlace structure. 35 

The motion compensation frame memory set , 
63 performs decoding based on the output data of 
the IDCT circuit 62, the macro block type and : the 
motion vector and writes the decoded picture as a 
forward predictive picture or a backward predictive 40 
picture into the frame memories. 

The picture corresponding to the selected 
macro block type is inputted to the spatio-temporal 
predictor 58. The spatio-temporal predictor 58 op- 
erates similarly to the spatio-temporal predictor 39 45 
of FIG. 1 in the encoder described hereinabove. 

In particular, a picture obtained by adaptively ..... 
weighting a predictive picture of the. upper layer 
which has a spatially high resolution but is different . ; . 
in temporal . sample points and another predictive . 50 
picture of the lower layer which may possibly be. 
inferior in spatial resolution, but is same in temporal . 
sample points is outputted from the spatio-temporal 
predictor 58. In the case of forward, backward or 
bidirectional prediction, since the difference from 55 
the predictive picture is transmitted as the output of 
the IDCT circuit 61, decoding is performed by 
adding the difference to the predictive picture using 



an adder 64. 

As described so far, both of the output of a 
non-interlace picture and the output of an interlace 
picture can be obtained from an upper layer input 
bit stream and a lower layer output bit stream. 

it is to be noted that, while, in the embodiment 
described above, a bit stream outputted from the 
encoder is recorded onto an optical disk, it may . 
otherwise be sent out into a transmission line for 
the ISDN or the communications by satellites. 

While a specific embodiment of the invention 
has been shown and disclosed, it is to be under- 
stood that numerous changes and modifications 
may be made by those skilled in the art without 
departing from the scope and intent of the inven- 
tion. 

Claims 

1. A video signal encoding apparatus, comprising: 

down sampling means (22) for sampling, a 
non-interlace input video signal by down sam- 
pling to form an interlace picture; 

first encoding means (2) for encoding the 
interlace picture to form a first bit stream; 

up sampling means (34) for sampling the. 
video signal based on the first bit stream by up 
sampling to form a non-interlace picture; and . .. 

second encoding means (1) for encoding 
the non-interlace input picture using the non- 
interlace picture. 

2. A video. signal encoding apparatus according, 
to claim 1 , wherein said first encoding means 
(2) includes first conversion means (23-29) for 
performing predetermined conversion of the 
interlace picture, and first local decoding 
means for locally decoding the output of said 
first conversion means (23-29) to form a first 
locally . decoded picture, said up sampling 
means sampling the first locally decoded pic- 
ture by up sampling to form the non-interlace 
picture. . . 

3. A video signal encoding apparatus according ' 
to claim 1, wherein said second encoding 
means (1) includes second conversion means 
(35-43) for performing predetermined con ver- \' 
sion of the non-interlace input signal, secpnd , , 
local decoding means, for locally decoding the ^ 
output of said second conversion means. (35-{ . . . 
43) to form a second locally decoded picture, ^ 
motion compensation means (38, 39) for per- 
forming motion compensation of the second 
locally decoded picture to .form a first predic- 
tive picture, and predictive picture formation 
means for forming a second predictive picture 
based on the non-interlace picture outputted 
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from said up sampling means (34) and the first 
predictive picture. 

4. A video signal encoding apparatus according 
to any of claims 1 to 3, wherein said down 
sampling means (22) includes thinning out 
means for thinning out pixels of the non-inter- 
lace input video signal, and filter means for 
changing over presence or absence of filtering 
to the input to said thinning out means in 
response to motion information. 

5. A video signal encoding apparatus according 
to any of claim 1 to 4, wherein said up sam- 
pling means (34) includes interpolation means 
for performing interpolation based on pixels of 
the first predictive picture at the same posi- 
tions and/or pixels of the first locally decoded 
picture at the same positions of an adjacent 
field. 

6. A video signal encoding apparatus according 
to claim 5, wherein said interpolation means 
includes coefficient generation means (81) for 
generating a predetermined coefficient in re- 
sponse to motion information. 

7. A video signal decoding apparatus, compris- 
ing: 

first decoding means (3) for forming an 
interlace decoded signal from a first input bit 
stream; 

up sampling means (57) for sampling the 
interlace decoded picture by up sampling to 
form a first non-interlace decoded signal; 

second decoding means (4) for forming a 
second non-interlace decoded picture from a 
second bit stream; 

motion compensation means (58) for per- 
forming motion compensation of the second 
non-interlace decoded picture to form a first 
predictive picture; and 

predictive picture formation means (55, 63) 
for forming a second predictive picture for de- 
coding the second non-interlace decoded pic- 
ture based on the first non-interlace decoded 
picture and the first predictive picture. 

8. A video signal decoding apparatus according 
to claim7, wherein said up sampling means 
(57) includes interpolation means for perform- 
ing interpolation based on peels of the first 
predictive picture at the same positions and/or 
pixels of the first locally decoded picture at the 
same positions of an adjacent field. 

9. A video signal decoding apparatus according 
to claim 8, wherein said interpolation means 



includes coefficient generation means (81) for 
generating a predetermined coefficient based 
on motion information. 

5 10. A video signal encoding method, comprising 
the steps of: 

sampling a non-interlace input video signal 
by down sampling to form an interlace picture; 
encoding the interlace picture to form a 
10 first bit stream; 

sampling a signal based on the first bit 
stream by up sampling to form a non-interlace 
picture; and 

encoding the non-interlace input picture 
75 using thie non-interlace picture. 

11. A video signal encoding method according to 
claim 10, wherein the step of encoding the 
interlace picture includes the steps of perform- 

20 ing predetermined conversion of the interlace 

picture to form first coefficient data and, and 
locally decoding the coefficient data to form a 
first locally decoded picture, the signal based 
on the first bit stream being the first locally 

25 decoded picture. 

12. A video signal encoding method according to 
claims 10 or 11, wherein the step of encoding 
the non-interlace input picture includes the 

30 steps of performing predetermined conversion 

of the non-interlace input picture to form sec- 
ond coefficient data, locally decoding the sec- 
ond coefficient data to form a second locally 
decoded picture, performing motion compen- 

35 sation of the second locally decoded picture to 

form a first predictive picture.and forming a 
second predictive picture based on the non- 
interlace picture and the first predictive picture. 

40 13. A video signal encoding method according to 
any of claims 10 to 12, wherein, upon the 
down sampling, presence or absence of filter- 
ing is changed over in response to motion 
compensation. 

45 

14. A video signal encoding method according to 
any of claims 10 to 12, wherein, upon the up 
sampling, interpolation is performed based on 
picture elements of the first predictive picture 

so at the same positions and/or pixels of the first 

locally decoded picture at the same positions 
of an adjacent field. 

15. A video signal encoding method according to 
55 claim 14, wherein the step of interpolation in- 
cludes the step of performing predetermined 
weighting of pixels of the first predictive pic- 
tureat the same positions and/or pixels of the 
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first locally, decoded picture at the same posi- 
tions of an adjacent field in response to motion 
information 

16. A video signal decoding method, comprising 5 
the steps of: 

forming an interlace decoded picture from 
a first input bit stream; 

sampling the interlace decoded picture by 
up sampling to form a first non-interlace de- 10 
coded picture; 

forming a second non-interlace decoded 
picture from a second bit stream; 

performing motion compensation of the 
second non-interlace decoded picture to form is 
a first predictive picture; and 

forming a second predictive picture for de- 
coding the second non-interlace decoded pic- 
ture based on the first interlace decoded pic- 
ture and the first predictive picture. 20 

17. A video signal decoding method according to. 
claim 16, wherein, upon the up sampling, inter- 
polation is performed based on pixels of the 

first predictive picture at the same positions 25 
and/or pixels of the first locally decoded pic- 
ture at the same positions of an adjacent field. 

18. A video signal decoding method according to 
claim 17, wherein the step of interpolation in- 30 
eludes the step of performing predetermined 
weighting of pixels of the first predictive pic- 
ture at the same positions and/or pixels of the 

first locally decoded picture at the same posi- 
tions of an adjacent field. 35 

19. A video signal recording medium, having a first 
bit stream and a second bit stream recorded 
thereon, the first bit stream being formed by 
encoding an interlace picture signal while the aq 
second bit stream is formed by encoding a 
non-interlace picture signal using the first bit 
stream. 
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© In an encoder (1, 2; Fig. 1), an input picture of a 
non-interlace structure is sampled by down sampling 
to form an interlace picture, and the interlace picture 
is encoded. The encoded data are decoded to form 
an interlace decoded picture, and the interlace de- 
coded picture is sampled by up sampling and uti- 
lized as a predictive picture upon encoding of the 
non-interlace input picture, in a decoder (7, 4; Fig. 



2), a bit stream corresponding to an interlace picture 
of a low resolution is decoded into an interlace 
decoded picture. The interlace decoded picture is 
sampled by up sampling and is utilized as a predic- 
tive picture upon decoding of a bit stream cor- 
responding to a non-interlace picture of a high reso- 
lution. 
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